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Abstract: DNA methylation of promoters is linked to transcriptional silencing of 
protein-coding genes, and its alteration plays important roles in cancer formation. For 
example, hypermethylation of tumor suppressor genes has been seen in some cancers. 
Alteration of methylation in the promoters of microRNAs (miRNAs) has also been linked 
to transcriptional changes in cancers; however, no systematic studies of methylation and 



Int. J. Mol ScL 2012, 13 



8260 



transcription of miRNAs have been reported. In the present study, to clarify the relation 
between DNA methylation and transcription of miRNAs, next-generation sequencing and 
microarrays were used to analyze the methylation and expression of miRNAs, 
protein-coding genes, other non-coding RNAs (ncRNAs), and pseudogenes in the human 
breast cancer cell lines MCF7 and the adriamycin (ADR) resistant cell line MCF7/ADR. 
DNA methylation in the proximal promoter of miRNAs is tightly linked to transcriptional 
silencing, as it is with protein-coding genes. In protein-coding genes, highly expressed 
genes have CpG-rich proximal promoters whereas weakly expressed genes do not. 
This is only rarely observed in other gene categories, including miRNAs. The present 
study highlights the epigenetic similarities and differences between miRNA and 
protein-coding genes. 

Keywords: DNA methylation; microRNA; cancer 



1. Introduction 

DNA methylation plays important roles in development, differentiation, X inactivation, genomic 
imprinting, and silencing of transposable elements through the regulation of transcription [1-3]. This 
usually occurs in mammalian cells at cytosine residues in the context of cytosine-phosphate-guanine 
dinucleotide (CpG), and approximately 60%-90% of cytosines at these sites are methylated [4]. 
Methyl-CpG serves as the physiological ligand for a family of proteins containing a highly conserved 
methyl-CpG binding domain (MBD) [5]. These proteins recruit various chromatin-modifying complexes 
to methyl-CpG sites to cause further chromatin structural changes that result in transcriptional silencing. 

Alterations of the patterns of normal DNA methylation result in many human diseases, including 
cancer [6]. Aberrant patterns of DNA methylation in cancers are associated with tumor type, stage, 
prognosis, and response to chemotherapy. Human tumors undergo global DNA demethylation, 
including of Line- 1 repetitive elements, and then DNA hypermethylation of certain gene promoters, 
including those of tumor suppressor genes [7]. Aberrant DNA methylation in cancers is not restricted 
to protein-coding genes; it is also observed in microRNA (miRNA) genes. MicroRNAs are small 
noncoding regulatory RNAs of 20-24 nucleotides that reduce the stability and/or translation of fully or 
partially sequence-complementary target mRNAs. MicroRNAs can act as oncogenes or tumor 
suppressors, and can contribute to cancer initiation and progression [8]. 

Genome-wide analysis of DNA methylation is an important issue in epigenetic research. The oldest 
technology for genome-wide analysis of DNA methylation, Restriction Landmark Genomic Scanning, 
which utilizes two-dimensional gel electrophoresis of genomic DNA, was developed in 1991 [9,10]. 
This technology has been used in the cloning of many imprinted genes [10] and tumor 
suppressors [11,12]. In this century, development of genome-wide analysis technology such as 
microarrays and next-generation sequencers brought about several new methods for analyzing 
DNA methylation [13]. Among these methods, capture and next-generation sequencing of methyl-CpG 
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binding domains of MBDl protein is especially useful for analysis because the binding activity of this 
protein is stronger than that of antibodies [14]. 

Although epigenetic silencing of miRNAs has been reported in many cancers, systematic studies of 
DNA methylation and miRNA transcription have not yet been reported. In addition, differentially 
methylated regions were found only in the CpG islands just upstream of miRNAs in most reports on 
the epigenetic silencing of miRNAs because such cases are easily discovered. Therefore, it has not 
been clarified whether the transcriptional start sites far upstream of miRNAs are influenced by 
DNA methylation or not. It has also not been clarified whether the CpG-poor promoters of miRNA are 
influenced by DNA methylation. Furthermore, miRNAs are not only transcribed by RNA 
polymerase II, which is responsible for most protein-coding genes, but also transcribed by RNA 
polymerase III. Although the relation between DNA methylation and silencing has been extensively 
studied in genes transcribed by RNA polymerase II, there are few reports for genes transcribed by 
RNA polymerase III. Therefore, systematic analyses of the methylation and expression of miRNAs are 
required. To clarify the relation between DNA methylation and transcription of miRNAs, here we 
systematically analyzed the methylation and expression of human genes encoding miRNAs, 
proteins, other non-coding RNAs (ncRNAs), and pseudogenes using next-generation sequencing and 
microarray analysis. 

2. Results and Discussion 

2.1. Genome-Wide Identification of Methylated DNA 

To study the genome-wide methylation signature, we performed massive sequencing of methylated 
DNA enriched by the MBD domain of MBDl, using an lUumina sequencer. We expressed a 
His-tagged fragment of MBDl (aa 1-75) in bacteria. This fragment contains the critical MBD domain 
required for stable and selective binding to methyl-CpG but no structural elements known to contribute 
to sequence-specific DNA binding [15]. Randomly shared methylated genomic DNA (about 300 bp) 
was bound to His- and GST-tagged fragments of MBDl and collected on Dynabeads Talon Magnetic 
beads, which bind to the His tag. The collected DNA was then purified and sequenced using a 
next-generation sequencer (lUumina). We performed the genome-wide methylation analysis on a 
human breast cancer-derived cell line, MCF7, and an adriamycin resistant cancer cell line 
MCF7/ADR [16]. We obtained 19 million single-end reads for MCF7 and MCF7/ADR with high 
quality read placement against the human genome. 

To determine the reliability of our genome-wide analysis of DNA methylation, the methylation 
signature attained in our analysis was confirmed using previous reports. Several genes were reported to 
be differentially methylated between MCF7 and MCF7/ADR. For example, ABCBl (MDRl), 
encoding P-glycoprotein, which is capable of mediating resistance to many antineoplastic drugs 
commonly used to treat breast cancer by acting as an efflux pump, is hypermethylated in MCF7 but 
demethylated and overexpressed in drug-resistant MCF7/ADR [17]. We confirmed this differential 
methylation between MCF7 and MCF7/ADR in our analyzed data (Figure la). The silencing of CDHl, 
which encodes E-cadherin, is important in the epithelial-to-mesenchymal transition. It has been 
reported by others [18] and confirmed by our results that CDHl is unmethylated in MCF7 but 
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hypermethylated in MCF7/ADR (Figure lb). Differential methylation of TGM2 [19], a potential 
molecular marker for chemotherapeutic drug sensitivity, was also confirmed (Figure Ic). 



Figure 1. Representative methyl-CpG binding domain (MBD)IDIP-Seq profiles of 
previously reported differentially methylated promoters between MCF7 and 
MCF7/adriamycin (ADR). Y axis represents the methylation levels of each cell line. 
Arrows and green bars denote transcription start sites and cytosine-phosphate-guanine 
dinucleotide (CpG) islands, respectively, (a) P-glycoprotein (ABCBl) locus; 
(b) E-cadherin (CDHl) locus; (c) transglutaminase 2 gene (TGM2) locus. 
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Next, we compared the methylation around the miRNA promoters between MCF7 and MCF7/ADR 
and pick up the differentially methylated regions. By confirming these differentially methylated 
regions using bisulfite sequencing, we tested the reliability of our analysis. Combining nucleosome 
mapping with chromatin signatures of promoters, 157 proximal promoters of human miRNA [20] were 
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identified and used for the analysis. We found several miRNA clusters, such as miR-200c/141 and 
miR-200ab/429, which were differentially methylated between these cell lines. For example, the 
proximal promoter of the miR-200c/141 cluster was hypermethylated in drug-resistant MCF7/ADR 
(Figure 2a), a finding confirmed by bisulfite sequencing (Figure 2b). In addition, the expression of 
these miRNAs was downregulated in MCF7/ADR (Figure 2c). Silencing of miR-200 families is 
important for the maintenance of breast cancer stem cells [21]. This family is also important for the 
regulation of the epithelial to mesenchymal transition [22] and drug-resistance [23]. The proximal 
promoter of the miR-200ab/429 cluster was also hypermethylated in MCF7/ADR (Figure 2d), and this 
was also confirmed to be silenced (data not shown). These results confirmed the reliability of our 
MBDl -based DNA methylation analysis. 

Figure 2. Representative MBDIDIP-Seq profiles of differentially methylated miRNA 
promoters between MCF7 and MCF7/ADR. Arrows and green bars denote transcription 
start sites and CpG islands, respectively. Red and magenta blocks indicate pre-miRNAs 
and scaRNAs, respectively, (a) Methylation of the miR-200c/141 locus; (b) Bisulfite 
sequencing results for the differentially methylated promoter of miR-200c/141. The 
methylation in the region corresponding to the black bar just upstream of the 
miR-200c/141 transcription start site in (a) is presented. Open circles and closed circles 
denote unmethylated and methylated CpG sites, respectively; (c) Real-time-PCR analysis 
of miR-200c and miR-141 expression. Expression level in MCF7 is normalized as 100%; 
(d) Methylation of the mir-200ab/429 locus. 
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Figure 2. Cont. 



MCF7 



MCF7/ 
ADR 



oooooooooooooooooooooooooo 
oooooooooooooooooooooooooo 
oooooooooooooooooooooooooo 
oooooooooooooooooooooooooo 
oooooooooooooooooooooooooo 
oooooooooooooooooooooooooo 
00 oooooooooooo oooooooooooo 
oooooooooooooooooooooooooo 
oooooooooooooooooooooooooo 
oooooooooooooooooooooooooo 
oooooooooooooooooooooooooo 
oo tooooooooooooooooooooooo 
00 oooooooooooo oooooooooooo 
oooooooooooooooooooooooooo 
oooooooooooooooooooooooooo 
oo ooooo oo ooooo ooooo ooooooo 



140 

C5^120 

^100 

•2 80 

^ 60 
Oh 

X 40 

w 

20 






miR-200c 
















































=^ 














0 




MCF7 


MCF7/ADR 



miR-141 



MCF7 



MCF7/ADR 



miR-200b 
miR-200a 
miR-429 
CpG islands 



(b) 



(c) 



I I 



% ^ 

1,087,0901 {,m,m\ \,mM^\ \,m,\ 




1 hSl8 

i,e92,e«0| 1,093,0001 i,m,m\ \,m,{ 

iiiiiiiiiiiiiinii 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




lIlllllllihMllllllllMi 

CpQ lilffrii <liian:ii < 388 lm% m Lssht Ortin) 



I 



(d) 



2.2. DNA Methylation versus Transcription 

To explore the relationship between gene expression and DNA methylation in the proximal 
promoter and elsewhere in the gene, we performed microarray analysis using the Agilent platform for 
miRNA and ordinary genes in MCF7. We were able to use 157 miRNAs, 600 RefSeq non-coding 
RNAs (ncRNAs) and 235 RefSeq pseudogenes for both expression and methylation analysis. We also 
used 1000 randomly selected RefSeq protein-coding genes. Before analysis, we plotted the 
CpG density of each gene category against the position in the genes, since CpG density is an important 
promoter characteristic. For genes of all categories, the highest CpG density was at the transcription 
start site, and this is characteristic of promoters (Figure 3). miRNA and protein-coding genes had 
higher CpG density at the transcription start site compared with other non-coding RNA genes and 
pseudogenes (Figure 3). We split the genes into two groups: "highly expressed" and "weakly 
expressed" genes. Highly expressed and weakly expressed genes were defined as those falling within 
the highest 20% expression quantile and the lowest 20% expression quantile, respectively. For each 
group, we plotted average cytosine methylation against gene position (Figure 4). This analysis was 
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performed for each category of gene: miRNA, protein-coding, other non-coding RNA, and pseudogene. 
In the highly expressed genes, we observed low methylation in the proximal promoters for both 
miRNAs and protein-coding genes (Figure 4). However, low methylation in the highly expressed 
genes was not observed in the proximal promoters for the other non-coding RNAs and pseudogenes 
(Figure 4). Thus, DNA methylation in the proximal promoter of miRNAs is tightly linked to 
transcriptional silencing, as is the case with protein-coding genes. 

Figure 3. CpG density around the transcription start site for each gene category. The 
average CpG density is plotted against distance from transcription site. 
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Figure 4. Methylation of weakly expressed and highly expressed genes. The average 
methylation for each gene category is plotted against distance from transcription start site. 
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In addition to proximal promoter methylation, the role of non-promoter methylation, such as in 
enhancers/far-upstream elements and within the body of the gene, was also examined. Methylation was 
seen in the gene body and far upstream region, with highly expressed genes showing more extensive 
methylation compared with weakly expressed genes in all categories, including miRNAs, 
protein-coding, other non-coding RNAs, and pseudogenes (Figure 4). Thus, non-promoter methylation 
of miRNAs is linked to transcriptional activation, as with other genes. In mammals, gene-body 
methylation has been observed in the active human X chromosome when compared with its inactive 
counterpart [24]. Genome-wide analysis of postnatal neural stem cells indicates that DnmtSa occupies 
and methylates non-promoter regions flanking proximal promoters of a large cohort of 
transcriptionally active genes, many of which encode regulators of neurogenesis [25]. 
Dnmt3a-dependent nonproximal promoter methylation promotes expression of these neurogenic genes 
by functionally antagonizing Polycomb repression. 

Thus, our genome-wide analysis demonstrated for the first time that the relation between DNA 
methylation and transcription of miRNAs are similar to that of protein-coding genes although some of 
miRNAs are transcribed by RNA polymerase III. In agreement with our results, U6 snRNA family 
genes, which are transcribed by RNA polymerase III, have been reported to be regulated by DNA 
methylation recently [26]. Increased expression of RNA polymerase III products is often observed in 
transformed cells [27]. It is also suggested that RNA polymerase III output can substantially affect 
transformation [27]. Therefore, transcriptional regulation of miRNAs transcribed by RNA polymerase 
III might play an important role in transformation. 

2,3, CpG Density YQYSUS Transcription 

CpG density underlies many characteristics of promoters. Ubiquitously expressed genes have 
CpG-rich promoters, while tissue-specific genes have CpG-poor promoters [28]. Therefore, it is 
possible that CpG density can determine the promoter activity of the genes. We plotted the CpG 
density of highly expressed and weakly expressed genes in each gene category against the position 
from the transcription start site (Figure 5). In protein-coding genes, highly expressed genes had higher 
CpG density in the promoter, while weakly expressed genes had lower CpG density (Figure 5). 
In other words, ubiquitously expressed genes have stronger promoter activity compared with 
tissue-specific genes. This characteristic is only weakly observed in gene categories other than 
protein-coding genes (Figure 5). Therefore, this correlation between CpG density and promoter 
activity could be related to protein-coding ability. Protein-coding genes are transcribed by RNA 
polymerase II, while genes coding small non-coding RNAs including some miRNAs are transcribed by 
RNA polymerase III. It is intriguing to think that the transcription by RNA polymerase II is affected by 
CpG density, while the transcription by RNA polymerase III is not. Non-coding RNAs including small 
RNA become important in cancer research. Further research is required to explain the relation between 
CpG density and promoter activity. 
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Figure 5. CpG density of weakly expressed and highly expressed genes. The average 
CpG density of each gene category is plotted against distance from transcription site. 
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3. Experimental Section 

3.1. Cell Lines and Culture Conditions 

The human breast cancer-derived cell line, MCF7, and its multi-drug resistant derivative, 
MCF7/ADR, were cultured in RPMI1640 medium containing 10% FBS. 

3.2. Isolation ofMethyl-DNA by MBDl 

Ten micrograms of genomic DNA were sonicated to 100-500 bp using a Bioruptor UCD-250 
(Cosmo Bio, Tokyo) sonicator. The capture reaction was performed by adding 10 mg of sonicated 
DNA to a mixture of 10 micrograms of His-tagged MBDl fragment (aa 1-75) and 60 |iL of 
Dynabeads TALON in MBDl buffer (10 mM Tris, pH 7.5, 160 mM NaCl, 0.1% Tween20) on a 
rotating mixer, overnight, at 4 °C. The beads were washed four times with MBDl buffer. 
The bound methylated DNA was then eluted by digestion with Proteinase K in 50 mM Tris (pH 7.5), 
10 mM EDTA, 0.5% SDS, at 50 °C for 3 h. Eluted DNA was purified with MinElute spin 
columns (QIAGEN). 

3.3. Illumina Genome Analyzer Library Preparation and Sequencing 

A single-end library was made following the modified manufacturer's protocol with reagents 
supplied in the Illumina DNA sample kit. Briefly, collected methylated DNA was end-repaired using 
Klenow and T4 DNA polymerases, phosphorylated with T4 polynucleotide kinase, and adenylated 
using Klenow exo-DNA polymerase, and oligonucleotide adapters were added using DNA ligase. The 
ligated product was amplified using adapter- specific primers 1.1 and 2.1 with Phusion DNA 
polymerase using the following protocol: 

98 °C (0:30) + [98 °C (0:10) + 65 °C (0:30) + 72 °C (0:30)] x 10 cycles + 72 °C (5:00) 

Amplified DNA was visualized in a 5% Acrylamide TBE gel, and a 300 to 400 bp size range was 
excised and purified using SPIN-X cartridges (Costar) and MinElute spin columns (QIAGEN). 
A single-fiow cell lane was sequenced in an Alumina Genome Analyzer. The reds were mapped to the 
reference human genome sequences (hgl8). We normalized the tag counts for each region to tags 
per million (ppm) in a window size of 200 bases, and this value was represented as methylation level. 

3.4. Bisulfite Sequencing 

One microgram of genomic DNA was bisulfite-treated with the EpiTect Bisulfite kit (QIAGEN) 
according to the manufacturer's protocol. PGR amplification was performed on bisulfite-modified 
DNA using the following primers: TTTATGGTAGGAGGATATATTTGTG and 
CACCTTAAATCAAACAACTTCAAAC. The resulting PGR product was cloned into a pGR2.1 
(Invitrogen) TA-cloning vector and was sequenced using a 3 130 DNA sequencer (Applied Biosystems). 
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3.5. Real-Time PCR Analysis ofmiRNA Expression 

Expression of mature miRNAs (miR-200c and miR-141) in human breast cancer cell lines was 
analyzed by TaqMan miRNA Assay (Applied Biosystems) under conditions defined by the supplier. 

3.6. Genome-Wide Gene Expression Analysis 

Expression analysis of miRNA and the other genes was carried out using Agilent's 
microarray-based miRNA and mRNA platforms, respectively. Labeling and hybridization was 
performed according to the manufacturer's protocol (Agilent Technologies). 

4. Conclusions 

In this study, we systematically analyzed the methylation and expression of human miRNAs in 
breast cancer cell lines and compared these levels with those of other gene categories using 
next-generation sequencing and microarray analysis. We found that DNA methylation in the proximal 
promoters of miRNAs and protein-coding genes was tightly linked to transcriptional silencing. 
Expression analysis revealed a correlation between expression level and CpG density of proximal 
promoters in protein-coding genes; however, this was only weakly observed in miRNA genes. 
Our observations highlight the epigenetic similarities and differences between miRNA and 
protein-coding genes. 
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